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(57) Abstract 

The invention relates to a method and an 
arrangement for the automatic data acquisition of 
paper-based information, whereby an unknown 
form (10) that is scanned into a computer is 
transformed into line elements (14, 15) for quicker 
identification. The identification of known forms 
(10) is also simplified by means of the line 
elements (14, 15). 
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Title 



Method and arrangement for automatic data acquisition of forms 



Technical field 



The present invention refers to a method and arrangement for the automatic data 



acquisition, by means of a means for the same, of forms whose design and information is not 
known in advance, by input into the said means together with storage of patterns of the same. 

The prior art 

It is a problem for companies, organisations and others to make good use of the 
information found in different types of paper forms, documents, etc. 

With new, modern technology, these items can be scanned with a scanner and 
entered into a database via commercially available software programs. However, sorting, 
identification and other checking routines must to a large extent still be performed manually via 
the computer's display or screen. 



specifically designed document with a logotype and other visual elements, it must be revised so 
that its format is adapted to one that can be accepted by the software and then stored in a 
database. This and other procedures must be repeated each time an invoice with a new design is 
scanned with the software. 



is often searched, which is time consuming. Certain software programs can have search routines 
that restrict the extent of this searching. It is, however, difficult to safeguard against blurred or 
hand-written lines of text, etc. 



able to identify these and/or quickly be able to enter and store new patterns in their invoicing 
system. 

Patent US-A-4 933 979 describes traditional data acquisition from forms and 
requires pre-defined templates/patterns with no self-learning (adaptive) ability. 

Patent US-A-5 140 650 mentions data acquisition from forms with what is known as 
"Form out'* technology to cover-over the original document and only retain the parts that are 
4< filled-in'\ This data acquisition is often combined with data acquisition according to US-A-4 
933 979. The patent does not have any adaptive function for data acquisition of unknown forms. 



For example, to store an invoice from one and the same company as one 



To identify an invoice from a company that is already registered, the whole invoice 



A need therefore exists for all who handle invoices and other forms to quickly be 



WO 98/47098 ^Pt/SE98/00602 

2 

Another patent, US-A-5 293 429, concerns the classification of documents with the 
help of lines on the documents and does not directly concern data acquisition or any adaptive 
function for this. US-A-5 293 429 does not ensure the identification of lines with object areas 
(areas with text) and a "RCG-value" (ReCoGnition, a number that uniquely identifies a 
5 document). 

None of the said patents generates a form map for a form unknown to the system 
according to the patent and stores the map in real time in a form database for recognition at the 
next opportunity for identification. For the inventions according to these patents, the unknown 
form must therefore be stored later by other means. 
10 Summary of the invention 

One of the objectives of the present invention is to solve the problems named above 
as well as others during what is known as automatic data acquisition (interpretation) in connection 
with the handling of paper-based information. 

The present invention concerns a system (method and arrangement) for the 
15 automatic data acquisition of forms where the system has no prior knowledge of what the form 
looks like or where on the form the information is to be found. In this way, templates of forms 
do not have to be defined in advance, but are instead registered as they are submitted to the 
system, i.e., in real time. 

To accomplish the above objectives, the present invention specifies a method and 
20 arrangement for the automatic data acquisition, by means of a means for the same, of forms 
whose design and information is not known in advance, by input into the said means together 
with storage of patterns of the same. The method is adaptive, by which it includes learning and 
registering of forms as patterns without filled-in text, and by it also including the following steps 
for accomplishing the adaptive registration: 
25 generation of a form map based on the previously unknown form's design for 

identifying information contained on the form; 

searching and comparing the form map with stored, registered maps in a means for 
storing form maps; 

storing generated form maps in the storage means when they do not coincide with a 
30 stored map according to pre-determined limits for agreement; 

indication of agreement according to the limits for agreement when agreement is 

found; and 
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continued data acquisition for identifying of the information content of the form. 

According to one embodiment of the present invention, the form map can consist of 
an object area list with objects contained in the form whereby the object comprises colours and/or 
wholly or partly of text. 

5 In an alternative embodiment, the form map constitutes a line map comprising 

objects in the form of coloured lines from the form. 

Horizontal lines in the line map are used to produce a horizontal key by dividing the 
form into a pre-determined number of horizontal segments along the y-axis in a cartographic 
system of co-ordinates, whereby each segment is equivalent to a position in the horizontal key. 
10 Vertical lines in the line map are used to produce a vertical key by dividing the 

form into a pre-determined number of vertical segments along the x-axis in a cartographic system 
of co-ordinates, whereby each segment is equivalent to a position in the vertical key. 

At least one line element that is included in a segment is marked in the equivalent 
key position, and segments that lack line elements remain unmarked in the equivalent key 
15 position. 

A horizontal key and/or a vertical key constitute a line key in the line map, whereby 
during the said searching, the line key generated is compared with line keys stored in the means 
for verifying agreement. 

The line keys are sorted in the storage means according to the number of markings. 
20 The object's horizontal position in the object area list is used to generate a 

horizontal key by dividing the form into a pre-determined number of horizontal segments along 
the y-axis in a cartographic system of co-ordinates, whereby each segment is equivalent to a 
position in the horizontal key. 

The object's vertical position in the object area list is used to produce a vertical key 
25 by dividing the form into a pre-determined number of vertical segments along the x-axis in a 
cartographic system of co-ordinates, whereby each segment is equivalent to a position in the 
vertical key. 

At least one object that is included in a segment is marked in the equivalent key 
position, and segments that lack objects remain unmarked in the equivalent key position. 
30 A horizontal key and/or a vertical key constitute an object key in the object area 

list, whereby during the said searching, the object key generated is compared with object keys 
stored in the means for verifying agreement. 
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The object keys are preferably sorted in the storage means according to the number 

of markings. 

Searching results in a pre-defined number of requested probable candidates for the 
currently searched form. 

If needed, an operator can support manually the whole or pans of the adaptive 
registration or identification of the new form or registered forms respectively if several 
alternative candidates are found as probabilities according to a factor of merit. 

Finally, the identity of the form is confirmed by the data acquisition of a RCG- 

value. 

Furthermore, the present invention specifies a arrangement for performing the 
above method. 

The arrangement carries out automated data acquisition, by means of a means for 
the same, of forms whose design and information is not known in advance, by input into the said 
arrangement together with storage of patterns of the same. It learns adaptively and registers the 
design of the form, and includes a computer with the following means for carrying out the 
adaptive registration: 

means for generation of a form map based on the previously unknown form's design 
for identifying information contained on the form; 

means for searching and comparing the form map with stored, registered maps in a 
means for storing form maps; 

means for storing generated form maps in the storage means when they do not 
coincide with a stored map according to pre-determined limits for agreement; 

means for indicating agreement according to the limits for agreement when 
agreement is found; and 

means for identification and continued data acquisition of the information content of 

the form. 

In addition, the arrangement can include or constitute that specified according to the 
above method of the present invention, which is further illustrated in the accompanying non- 
independent claims for the arrangement. 

Brief description of the drawings 

Further reference to the enclosed figures and associated text will give a clearer 
understanding of the present invention. 



0^ Qh/i 
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Fig. 1 shows schematically how a line pattern is accomplished from a scanned-in 

invoice. 

Fig. 2 shows schematically a flow-path for scanning, identifying, interpreting and 
validating a form according to the present invention. 

Detailed description of preferred embodiments 

In the continued description of the present invention, the forms are presented as 
invoices. However, the invention is not limited to invoice forms but also covers general 
documents containing text, figures, etc. as forms. Invoices are used here as an example of a form 
to exemplify the invention. 

Fig. 1 illustrates schematically one part of invoice 10 that is scanned in a computer 
and that is shown on the display. As is evident from invoice 10, it is unclear or blurred after the 
scanning or input. 

Invoice 10 consists partly of a logotype 12 and the vertical 14 and horizontal line 15 

elements. 

Note that even the logotype contains long black or varying degrees of shaded 
coloured line elements 16 that have been partly registered in a line map 18 according to the 
present invention, and that give an idea of what the original logotype 12 looked like, which 
simplifies identification when the invoice is an object to be identified as being as registered in a 
form map database. Coloured lines also include grey scales of black. 

The form map that in this case constitutes line map 18 has been filtered from other 
objects 19, such as whole or parts of text objects or coloured objects, plus even the said line 
elements that include colour, which cannot be reproduced here, but that can be included as many 
coloured fields on a form 10. 

An invoice 10 that is prepared according to the present invention, hereafter 
designated EH (Eyes & Hands), must be identified at an early stage. For successful 
identification, EH must have on a previous occasion, learned what the current invoice 10 looks 
like, which in simple terms means that information about the invoice is available in the form 
database in EH. 

By necessity, the identification must be quick and be able to be made in a database 
that holds a very large number of invoice identities 18. It is not uncommon for databases to 
contain more than 10,000 identities 18. 
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The method and arrangement that EH uses does not require that an invoice is 
always put through a scanner in exactly the same way, i.e. the information on the invoice can 
vary somewhat in the x and y axes within a pre-determined measurement or threshold value. Fig. 
1 shows a schematic cartographic system of co-ordinates. 

In the present invention, (EH) comprises that in one embodiment, EH searches for 
all vertical 14 and horizontal line 15 elements of a pre-determined length on the invoice. Lines 
14, 15 do not need to be free-standing and isolated, but can, for example, be part of a larger 
logotype text 12, such as ReadSoft AB in Fig. 1. The logotype 12 is represented as the line 
element 16 in line map 18. 

The horizontal lines 15 and the vertical lines 14 constitute the basis for generating a 
horizontal key (h-key) and a vertical key (v-key) respectively according to the following: 

* The invoice is divided into a large number of horizontal segments along the y-axis (not 
shown). Each segment is equivalent to one position in the h-key. If a certain segment 
includes one or more line elements 15, a mark or tag is placed in the equivalent key 
position. If not, an empty space, an inverted mark or anything else that differentiates itself 
from a mark is used instead. 

* A v-key for the vertical line elements 14 is generated in a similar manner along the x- 
axis. 

* The h and v-keys are given designations and together constitute a line key. Following 
this, a search is performed, which means that the current line key is compared with line 
keys for known invoices 10 that exist in EH's database. This comparison takes into 
account that individual lines or line elements 14, 15 can vary somewhat in position, plus 
that the total pattern of lines can be displaced somewhat according to suitable pre- 
determined values in the x and y directions, horizontally and vertically respectively. 

* The line keys in the database are sorted according to the number of markings (tags), 
which are used to make the searching effective. 

* The search results in a pre-determined number of probable candidates for the identity of 
the current invoice 10. All candidates are associated with a factor of merit or a probability 
that they are the current invoice 10. 

* The identity of the invoice is finally confirmed by carrying out an interpretation of that 
known as the RCG- value (RCG- ReCoGnition). The RCG-value is a value at a given 
position that is unique for a certain invoice/supplier or other form. Examples of such 
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values are bank giro numbers, post-office giro numbers, invoice numbers, total amounts, 
etc. 

The said segments can, for example, form checked patterns that are fine-screened to 
varying extents according to the relative need for rapid searching. 
5 The line keys can even be implemented on objects formed wholly or partly of text 

and colours. These are assigned line keys from an object area list that includes x and y-keys for 
the object. The object area list can, for example, consist of positions for certain selected objects. 
The principles for line maps stated above are even appropriate for objects other than line 
elements to accomplish identification of forms. 
10 If the line keys are not found in the database, this indicates that the invoice is not 

known, which results in the new line keys being stored in the database that, in this way, is 
updated in real time. 

If necessary, the operator can, via his computer, manually support the whole or part 
of the adaptive registration and/or identification of a new form or registered form respectively if 
15 several alternative candidates are presented as probable according to the factor of merit. 

In addition, the present invention includes an arrangement for performing the 
method according to the above. 

The arrangement performs the automatic data acquisition, by means of a means for 
the same, of forms whose design and information content is not known in advance, by input into 
20 the said means together with storage of patterns of the same. It registers in an adaptive manner 
and learns the design of forms, and includes a computer with the following means for 
accomplishing the adaptive registration: 

means for generating a form map based on the previously unknown form's design 
for identifying information contained on the form; 
25 means for searching and comparing the form map with stored, recognised maps in a 

means for storing form maps; 

means for storage of generated form maps in the storage means when they do not 
coincide with a stored map according to pre-determined limits for agreement; 

means for indicating agreement according to the limits for agreement when 
30 agreement is found; and 

means for identification and continued data acquisition of the information content of 

the form. 
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The said means are preferably controlled by computer hardware and software, such 
as, for example: 

A scanner for acquisition of data. 

An electronic storage medium (hard disk, CD-ROM, etc.) for the means to store 
information 

Signs, icons, signal generators, etc. for indicating purposes. 

Filters and comparitors so that the means can search and compare, as well as filters and 
registers for identification. 

On the whole, the means used in the present invention are well known to a skilled 
person in the technical field, but the way in which they are co-ordinated to achieve the object of 
the invention is, however, innovative. 

In one embodiment of the present invention with reference to Fig. 2, a schematic 
flow-path is illustrated to show the scanning, identification, interpretation, and validation of a 
form according to the present invention. 

Fig. 2 is divided by dotted lines into partial areas to clarify the different steps in a 
method according to the invention, whereby the steps constitute the scanning of the form 200, 
identification of the form 210, interpretation of the form 220, plus validating the form 230. 

The form is scanned 200 into EH, and identification 210 follows. Identification 
consists of generating a line map 212, or alternatively an object area list, whereby an line key is 
generated. Following this, form 10 is compared 214 with known keys in the form map database, 
whereby a conformation of identification is obtained via the RCG-value. The next step includes 
deciding whether the identification was successful 216 according to the conditions 44 Yes*' or 
"No". If the decision results in "No", a conditional investigation is made to see if there are 
more candidates in the form of line keys 218. In the answer here is "Yes", a loop in the form of 
214, 216 and 218 is performed until a successful identification is finally made, or until no further 
line key candidates are presented 218. 

In the case of a successful identification, interpretation 220 of the form then begins 
by interpreting with the help of the current form map 222, after which validation 230 or 
evaluation 232 of the fields of the form 10 takes place. As an option, the operator can assist with 
selection if several alternative fields are found 234. 

If the identification 210 is unsuccessful, and no further line keys are presented 218, 
interpretation 220 is performed in that self-learning with a form definition 224 is accomplished. 
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The form definition consists of a template or a set of rules that describes the common elements of 
a specific collection of forms, for example, Swedish invoices. Following this, the RCG-value is 
interpreted 226 and a decision is made 228 whether the current RCG-value can be found in the 
form database. If the answer is "Yes", a re-interpretation begins 229, followed by a continued 
5 interpretation 222 that leads to validation 232. 

If, on the other hand, the answer is "No", validation commences 230, 236, after 
which the form is saved in the form map database with the line key 238. Prior to steps 236, 238, 
the operator can, if several field alternatives are found, assist with the self-learning process. 
The embodiments of the present invention described above are possible 
10 embodiments, but are not intended to limit the invention to such, as further embodiments will be 
evident to a skilled person in the technical area via the drafts of the enclosed claims. 
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Claims 

1. Method for the automatic data acquisition (200), by means of a means for the 
same, of forms (10) whose design and information content (19) is not known in advance, by input 
into the said means, together with storage of patterns of the same, characterised in that 

5 the method is adaptive (224) and includes self-learning and registration of the design of forms, 
whereby it includes the following steps to accomplish the adaptive registration (238): 

generation of a form map (18) based on the previously unknown form's (10) design 
for identifying (210) information contained on the form; 

searching and comparing (210, 220) the form map (18) with stored, registered maps 
10 in a means for storing form maps; 

storage (238) of generated form maps (18) in the storage means when they do not 
coincide with a stored map according to pre-determined limits for agreement; 

indication of agreement according to the limits for agreement when agreement is 

found; and 

15 continued data acquisition (232) for identifying the information content of the form. 

2. Method according to claim 1, characterised in that the form map (18) 
consists of an object area list with objects (19) contained in the form. 

3* Method according to claim 2, characterised in that the object (19) 
comprises colours and/or wholly or partly of text. 
20 4. Method according to claim 1, characterised in that the form map (18) 

constitutes a line map comprising line elements (14,15) from the form (10). 

5. Method according to claim 4, characterised in that the horizontal lines 
(15) in the line map are used to generate a horizontal key by dividing the form into a pre- 
determined number of horizontal segments along the y-axis in a cartographic system of co- 

25 ordinates, whereby each segment is equivalent to a position in the horizontal key. 

6. Method according to claims 4 and 5, characterised in that the vertical 
lines (14) in the line map are used to generate a vertical key by dividing the form (10) into a pre- 
determined number of vertical segments along the x-axis in a cartographic system of co- 
ordinates, whereby each segment is equivalent to a position in the vertical key. 

30 7. Method according to claims 5 and 6, characterised in that at least one 

line element (14, 15) that is included in a segment is marked in the equivalent key position, and 
that segments that lack line elements remain unmarked in the equivalent key position. 
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8. Method according to claims 4-7, characterised in that the horizontal key 
and/or a vertical key constitute a line key in the line map (18), whereby, during the said 
searching, the line key generated is compared (214) with line keys stored in the storage means for 
verifying agreement. 

5 9. Method according to claim 8, characterised in that the line keys are 

sorted in the storage means according to the number of markings. 

10. Method according to claims 1 and 2, characterised in that the object's 
horizontal position in the object area list is used to generate a horizontal key by dividing the form 
into a pre-determined number of horizontal segments along the y-axis in a cartographic system of 

10 co-ordinates, whereby each segment is equivalent to a position in the horizontal key. 

1 1 . Method according to claims 1 and 2, characterised in that the object's 
vertical position in the object area list is used to generate a vertical key by dividing the form into 
a pre-determined number of vertical segments along the x-axis in a cartographic system of co- 
ordinates, whereby each segment is equivalent to a position in the vertical key. 

15 12. Method according to claims 10 and 11, characterised in that at least 

one object that is included in a segment is marked in the equivalent key position, and that 
segments that lack objects remain unmarked in the equivalent key position 

13. Method according to claims 10-12, characterised in that a horizontal 
key and/or a vertical key constitute an object key in the object area list, whereby, during the said 

20 searching, the object key generated is compared with object keys stored in the storage means for 
verifying agreement. 

14. Method according to claim 13, characterised in that the object keys are 
sorted in the storage means according to the number of markings. 

15. Method according to claims 1-14, characterised in that the searching 
25 (210, 218) results in a pre-defined number of requested probable candidates for the currently 

searched form. 

16. Method according to claim 15, characterised in that an operator can 
support manually (234, 239) the whole or parts of the adaptive registration or identification of the 
new form or registered forms respectively if several alternative candidates are found as 

30 probabilities according to a factor of merit. 

17. Method according to claims 1-16, characterised in that the identity of 
the form is confirmed by the data acquisition of a RCG-value (214). 
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18. Arrangement for the automatic data acquisition, by means of a means for the 
same, of forms (10) whose design and information content (19) is not known in advance, by input 
into the said means together with storage of patterns of the same, characterised in that it 
learns adaptively and registers the design of forms (10), and includes a computer with the 
following means for carrying out the adaptive registration (238): 

means for generating a form map (18) based on the previously unknown form's (10) 
design for identifying information contained on the form; 

means for searching and comparing the form map with stored, recognised maps in a 
means for storing form maps; 

means for storage of generated form maps (18) in the storage means when they do 
not coincide with a stored map according to pre-determined limits for agreement; 

means for indicating agreement according to the limits for agreement when 
agreement is found; and 

means for identification and continued data acquisition of the information content of 

the form. 

19. Arrangement according to claim 18 characterised in that the form map 

(18) consists of an object area list with objects (19) contained in the form (10). 

20. Arrangement according to claim 19, characterised in that the object 

(19) comprises colours and/or wholly or partly of text. 

21 . Arrangement according to claim 18, characterised in that the form map 
(18) constitutes a line map comprising line elements (14, 15) from the form. 

22. Arrangement according to claim 21, characterised in that the 
horizontal lines (15) in the line map are used to generate a horizontal key by dividing the form 
into a pre-determined number of horizontal segments along the y-axis in a cartographic system of 
co-ordinates, whereby each segment is equivalent to a position in the horizontal key. 

23. Arrangement according to claims 21 and 22, characterised in that the 
vertical lines (14) in the line map are used to generate a vertical key by dividing the form into a 
pre-determined number of vertical segments along the x-axis in a cartographic system of co- 
ordinates, whereby each segment is equivalent to a position in the vertical key. 

24. Arrangement according to claims 22 and 23, characterised in that at 
least one line element (14, 15) that is included in a segment is marked in the equivalent key 
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position, and that segments that lack line elements remain unmarked in the equivalent key 
position. 

25. Arrangement according to claims 22-24, characterised in that the 
horizontal key and/or a vertical key constitute a line key in the line map, whereby, during the 

5 said searching, the line key generated is compared with line keys stored in the storage means for 
verifying agreement. 

26. Arrangement according to claim 25, characterised in that the line keys 
are sorted in the storage means according to the number of markings. 

27. Arrangement according to claims 18 and 19, characterised in that the 
10 object's (19) horizontal position in the object area list is used to generate a horizontal key by 

dividing the form into a pre-determined number of horizontal segments along the y-axis in a 
cartographic system of co-ordinates, whereby each segment is equivalent to a position in the 
horizontal key. 

28. Arrangement according to claims 18 and 19, characterised in that the 
15 object's (19) vertical position in the object area list is used to generate a vertical key by dividing 

the form into a pre-determined number of vertical segments along the x-axis in a cartographic 
system of co-ordinates, whereby each segment is equivalent to a position in the vertical key. 

29. Arrangement according to claims 27 and 28, characterised in that at 
least one object that is included in a segment is marked in the equivalent key position, and that 

20 segments that lack objects remain unmarked in the equivalent key position. 

30. Arrangement according to claims 27-29, characterised in that a 
horizontal key and/or a vertical key constitute an object key in the object area list, whereby, 
during the said searching, the object key generated is compared with object keys stored in the 
storage means for verifying agreement. 

25 31 . Arrangement according to claim 30, characterised in that the object 

keys are sorted in the storage means according to the number of markings. 

32. Arrangement according to claims 18-31, characterised in that the 
searching results in a pre-defined number of requested probable candidates for the currently 
searched form. 

30 33. Arrangement according to claim 32, characterised in that an operator 

can support manually the whole or parts of the adaptive registration or identification of the new 
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form or registered forms respectively if several alternative candidates are found as probabilities 
according to a factor of merit. 

34. Arrangement according to claims 18-33, characterised in that the 
identity of the form is confirmed by the data acquisition of a RCG-value. 
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Forfarande och anordning for automatisk datafingst hos formular 

Tekniskt omr&de 

5 Foreliggande uppfinning hanfor sig till ett forfarande och en anordning for 

automatisk datafingst, via organ for detsamma, av formular med pi forhand okand 
utformning och informationsinnehill, vid inmatning i namnda organ, samt lagring av 
forlagor av densamma. 

Teknikens st&ndpunkt 

10 Det innebar problem for foretag, organisationer och andra, att ta vara pi informa- 

tion som forekommer pi olika typer av formular, blanketter, dokument etc. 

Med ny och modern teknik avsoks sidana med en skanner och skannas in i en 
databas via pi marknaden forekommande datorprogram. Sortering, identifiering och andra 
kontrollrutiner miste dock i hog grad skotas manuellt interaktivt via datorns display eller 

15 skarm. 

For att t ex lagra en faktura, som kan forekomma som ett, for ett och samma 
foretag, specifikt utformat dokument med logotyp och andra fortecken, miste den 
omarbetas sk att dess format anpassas till vad programvaran accepterar och sedan lagras 
i en databas. Varje g&ng en faktura med ny utformning skannas in i programvaran miste v 

20 dessa och andra procedurer upprepas. 

For att identifiera en faktura frin ett foretag som redan finns registrerat soks ofta 
pi hela fakturans utformning, vilket ar tidskravande. Vissa programvaror kan ha sokruti- 
ner som begransar dess omfattning. Det ar dock svirt att gardera mot bluniga text- 
strangar, handskrivna sddana etc. 

25 Det existerar darfor ett behov, hos alia som hanterar fakturor eller andra blanket- 

ter, att snabbt kunna identifiera dessa och/eller att snabbt kunna infora och lagra nya 
forlagor i sitt faktureringssystem. 

Patentet US-A-4 933 979 beskriver traditionell datafingst Mn formular och 
kraver fordefinierade mallar/monster utan n&gon sjalvlarande (adaptiv) formSga. 

30 I patentet US-A-5 140 650 anges datafingst friLn formular med s k "Form out" 

teknik for att maska bort ursprungsdokumentet och endast behilla de "ifyllda delarna", 
vilken datafingst ofta kombineras med dataf&ngsten enligt US-A-4 933 979. Patentet 
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uppvisar inte nigon adaptiv funktion for datafingst av okanda formular. 

Ett annat patent US-A-5 293 429 avser klassificering av dokument med hjalp av 
linjer pi dokumenten och beror inte direkt datafingst eller nigon adaptiv funktion for 
detta. US-A-5 293 429 sakerstaller inte identifiering av linjer med objektomriden 
5 (omriden med text) och ett "RCG-varde" (ReCoGnition, nummer som unikt identifierar 
ett dokument). 

Inget av har namnda patentdokument alstrar en formularkarta av ett for systemen 
enligt patenten okant formular och lagrar kartan i realtid si att den ingir i en formularda- 
tabas for igenkannande vid nasta tillfalle for identifiering. For uppfinningarna enligt 
10 patenten miste siledes dokument som ar okanda lagras i efterhand med separata metoder. 

Sammanfattning av uppfinningen 
Ett av andamilen med foreliggande uppfmning ar att losa ovan namnda och andra 
problem vid s k automatisk datafingst (tolkning) i samband med hantering av pappers- 
buren information. 

15 Foreliggande uppfinning avser ett system (forfarande och en anordning) for auto- 

matisk datafingst pi formular dar systemet inte frin borjan kanner till hur ett formular ser 
ut och var pi formularet information iterfinnes. Pi si vis behover inte formularmallar 
definieras i forvag, utan dessa registreras allt eftersom de inkommer i systemet, d v s i 
realtid. 

20 For att istadkomma ovanstiende anger foreliggande uppfinning ett forfarande for 

automatisk datafingst, via organ for detsamma, av formular med pi forhand okand 
utformning och informationsinnehill, vid inmatning i namnda organ, samt lagring av 
forlagor av densamma. Forfarandet ar adaptivt, varvid det innefattar inlarning och 
registrering av utformningen av formular som forlagor utan ifylld text, och varvid det 
25 vidare innefattar foljande steg for att istadkomma den adaptiva registreringen: 

alstring av en formularkarta utifrin det pi forhand okanda formularets utformning 
for identifiering av formular innehillande information; 

sokning med jamforelse av formularkartan med lagrade inlarda kartor i ett organ 
for lagring av formularkartor; 
30 lagring av alstrad formularkarta i organet for lagring di den ej overensstammer 

med en lagrad karta enligt forutbestamda granser for overensstammelse; 

indikering av overensstammelse enligt granserna for overensstammelse di 
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overensstammelse foreligger; och 

fortsatt dataftngst for identifiering av formularets informationsinneh&ll. 

Formularkartan kan i en utforingsform av fdreliggande uppfinning best£ av en 
objektomrideslista med objekt innefattade i formular, varvid objekten bestir av farger 
5 och/eller hel eller del av text, 

I en alternativ utforingsform utgor formularkartan en linjekarta innefattande 
objekt i form av fargade linjeelement ur formularet. ' 

Horisontella linjer i linjekartan anvands for alstring av en horisontell nyckel 
genom att formularet indelas i ett forutbestamt antal horisontella segment langs y-axeln i 
10 ett kartesiskt koordinatsystem, varvid varje segment motsvarar en position i den horison- 
tella nyckeln. 

Vertikala linjer i linjekartan anvands for alstring av en vertikal nyckel genom att 
formularet indelas i ett forutbestamt antal vertikala segment langs x-axeln i ett kartesiskt 
koordinatsystem, varvid varje segment motsvarar en position i den vertikala nyckeln. 
15 Minst ett linjeelement som innefattas i ett segment marks i motsvarande nyckel- 

position och segment vilka saknar linjeelement forblir omarkta i motsvarande nyckelpo- 
sition. 

En horisontell nyckel och/eller en vertikal nyckel utgor en linjenyckel i lin- 
jekartan, varvid, vid namnda sokning, alstrad linjenyckel jamfors med linjenycklar som * 
20 finns lagrade i organet for lagring for kontroll av overensstammelse. 

Linjenycklarna ar sorterade efter antalet markningar i organet for lagring. 

Objekts horisontella position i objektomr&deslistan anvands for alstring av en 
horisontell nyckel genom att formularet indelas i ett forutbestamt antal horisontella 
segment langs y-axeln i ett kartesiskt koordinatsystem, varvid varje segment motsvarar en 
25 position i den horisontella nyckeln. 

Objekts vertikala position i objektomr&deslistan anvands for alstring av en vertikal 
nyckel genom att formularet indelas i ett forutbestamt antal vertikala segment langs x- 
axeln i ett kartesiskt koordinatsystem, varvid varje segment motsvarar en position i den 
vertikala nyckeln. 

30 Minst ett objekt som innefattas i ett segment marks i motsvarande nyckelposition 

och segment vilka saknar objekt forblir omarkta i motsvarande nyckelposition. 

En horisontell nyckel och/eller en vertikal nyckel utgor en objektnyckel i objekt- 
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omrideslistan, varvid, vid namnda sokning, alstrad objektnyckel jamfors med objekt- 
nycklar som finns lagrade i organet for lagring for kontroil av overensstammelse. 

Objektnycklarna ar foretradesvis sorterade efter antalet markningar i organet for 

lagring. 

5 Sokningen istadkommes att resultera i ett antal p& forhand begarda troliga 

kandidater for aktuellt sokt formular. 

En operator kan vid behov manuellt stodja hela ellerdelar av den adaptiva 
registreringen eller identifieringen av nytt formular resp registrerat formular om flera 
alternativa kandidater forekommer som sannolika enligt ett godhetstal. 
10 Slutligen kan formularets identitet faststalles genom dataf&ngst av ett RCG-varde. 

Vidare anger foreliggande uppfinning en anordning for utovande av ovanstiende 
forfarande. 

Anordningen genomfor automatisk dataf&ngst, via organ for detsamma, av 
formular med pk forhand okand utformning och informationsinnehail, vid inmatning i 
15 namnda organ, samt lagring av forlagor av densamma. Den lar sig adaptivt och registrerar 
utformningen av formular, och innefattar en dator med foljande organ for att &stadkomma 
den adaptiva registreringen: 

organ for alstring av en formularkarta utifrin det pi forhand okanda formularets 
utformning for identifiering av formular innehfillande information; 
20 organ for sokning med jamforelse av formularkartan med lagrade inlarda kartor i 

ett organ for lagring av formularkartor; 

organ for lagring av alstrad formularkarta i organet for lagring di den ej overens- 
stammer med en lagrad karta enligt forutbestamda granser for overensstammelse; 

organ for indikering av overensstammelse enligt granserna for overensstammelse 
25 di overensstammelse foreligger; och 

organ for identifiering och fortsatt datafingst av formularets informationsinnehail. 

Ytterligare kan anordningen innefatta eller utfora det som anges enligt forfarandet 
ovan i foreliggande uppfinning, vilket ytterligare belyses via bilagda osjalvstandiga 
anordningskrav. 

30 Kortfattad beskrivning av ritningsfigurer 

Fortsattningsvis hanvisas till bilagda figurer med tillhorande text for en battre 
forstielse av foreliggande uppfinning. 
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matiskt hur ett linjemdnster astadkomme! 



Fig. 1 illustrerar scnematiskt hur ett linjemdnster 4stadkommes ur en inskannad 
faktura. 

Fig. 2 illustrerar schematiskt ett flode for skanning, identifiering, tolkning, 
validering av ett formular enligt foreliggande uppfinning. 
5 Detaljerad beskrivning av foredragna utforingsformer 

I den fortsatta beskrivningen av foreliggande uppfinning forekommer formular 
som fakturor. Uppfinningen ar dock inte begransad till fakturor som formular, utan 
allehanda dokument innefattande text, figurer etc ar att betrakta som formular. Fakturor 
anvands har som ett exempel p4 ett formular for exemplifiering av uppfinningen. 
10 I fig. 1 illustreras schematiskt en del av en faktura 10 som skannats in i en dator 

och som visas p& datorns display. Som framgir av fakturan 10 ar den otydlig eller blurrig 
efter inskanningen eller avlasningen. 

Fakturan 10 bestir till en del av en logotyp 12 och olika vertikala och horisontella 
linjeelement 14. 

15 Att observera ar aven att logotypen inneh&ller linga svarta, eller fargade i 

skiftande kulorer, linjeelement 16 som delvis registreras i en linjekarta 18 enligt fore- 
liggande uppfinning och ger en uppfattning av hur den ursprungliga logotypen 12 sig ut, 
vilket underlattar nar fakturan ar fdrem&l for att identifieras i en formularkartsdatabas som 
registrerad. I fargade linjeelement innefattas aven giiskalor av svart. 

20 Formularkartan som har utgor en linjekarta 18 har filtrerats fr&n andra objekt 19, 

som exempelvis hela eller delar av textobjekt eller fargobjekt, som namnts aven lin- 
jeelement inefattande farg, vilka har inte kan iterges, men ett formular 10 kan innefatta 
m&nga fargfalt. 

En faktura 10 som bearbetas enligt foreliggande uppfinning, harefter benamnd EH 
25 (Eyes & Hands), m&ste i ett tidigt skede identifieras. For att en identifikation skall lyckas 
miste EH vid n&got tidigare tillfalle ha lart sig hur den aktuella fakturan 10 ser ut, vilket i 
korthet innebar att informationen om fakturan finns i en formulardatabas i EH. 

Identifikationen miste med nodvandighet vara snabb och kunna istadkommas mot 
en databas som innehiller ett mycket stort antal mojliga fakturaidentiteter 18, Det ar inte 
30 ovanligt att det forekommer mer an 10 000 identiteter 18 i databasen. 

Forfarandet och anordningen som EH anvander kraver inte att en faktura alltid 
dras igenom en skanner pd exakt samma satt, d v s information pi fakturan till&ts att 
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variera n&got i x- och y-lecnnom forutbestamda mitt eller troskelvSraen. I fig. 1 visas 
schematiskt ett kartesiskt koordinatsystem. 

I foreliggande uppfinning (EH) innefattas att EH i en utforingsform soker efter 
samtliga vertikala 14 och horisontella 15 linjeelement av en forutbestamd langd pk 
fakturan 10. Linjerna 14, 15 behover inte vara fristdende och isolerade, utan kan t ex 
vara en del i en storre logotext 12, ReadSoft AB i fig. 1. Logotypen 12 avspeglar sig som 
linjeelement 16 i linjekartan 18. 

De horisontella linjerna 15 och vertikala linjerna utgor grund for alstring av en 
horisontell nyckel (h-nyckel) resp en vertikal nyckel (v-nyckel) enligt foljande: 



* Fakturan indelas i ett stort antal horisontella segment langs y-axeln (ej visat). 
Varje segment motsvarar en position i h-nyckeln. Om ett visst segment inneh&ller 
ett eller flera linjeelement 15 satts ett marke (tagg) pi motsvarande nyckelpo- 
sition, i annat fall en lucka, ett inverterat marke etc, i stort nigot som skiljer sig 

15 frin market. 

* En v-nyckel for de vertikala linjeelementen 14 alstras pi liknande satt langs x- 
axeln. 

* H- och v-nyckeln benamnes och utgor tillsammans en linjenyckel. Harefter 
istadkommes en sokning, vilket innebar att aktuell linjenyckel jamfors med de , 

20 linjenycklar, for kanda fakturor 10, som finns i EH:s databas. Vid jamforelsen tas 

hansyn till att enstaka linjer eller linjeelement 14, 15 kan variera n&got i position, 
eller i vissa fall helt saknas, samt att det totala linjemonstret kan vara forskjutet 
nigot enligt lampliga forutbestamda varden i x- och y-led resp horisontellt och 
vertikalt. 

25 * Linjenycklarna i databasen ligger sorterade efter antalet marken (taggar), vilket 

ar till for att effektivisera sokningen. 

* Sokningen i sig resulterar i, ett pi forhand, antal troliga kandidater p& aktuell 
fakturaidentitet 10. SamUiga kandidater ar aven forknippade med ett godhetstal 
eller en sannolikhet for att de ar aktuell faktura 10. 

30 * Fakturans identitet faststalls slutligen genom att istadkomma en tolkning av det 

s k RCG-vardet (RCG-ReCoGnition). RCG-vardet ar ett varde, pi given position, 
som ar unikt for en viss faktura/leverantor eller annat formular. Exempel p4 
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s&dana varden ar bankgironummer, postgironummer, fakturanummer, totalsumma 
etc. 

Namnda segment kan exempelvis bilda rutmonster mer eller mindre finmaskiga 
5 efter behov relaterat till en snabb sokning. 

Linjenycklar kan aven implementeras p4 objekt i form av hel eller del av text och 
farger, vilka tilldelas linjenycklar ur en objektomr&deslista som innefattar x- och y-nycklar 
for objekt. Objektomr&deslistan kan exempelvis besti av positioner for vissa utvalda 
objekt. Principerna for linjekartor enligt ovan kan aven anammas for andra objekt an 
10 linjeelement for att &stadkomma identifiering av formular. 

Vidare om linjenycklar inte iterfinnes i databasen indikerar det att fakturan inte ar 
kand, vilket f&r till foljd att den nya linjenyckeln lagras i databasen som sMedes upp- 
dateras i realtid. 

Vid behov kan en operator manuellt via sin dator stodja hela eller del av adaptiv 
15 registrering och/eller identifiering av nytt formular resp registrerat formular om flera 
alternativa kandidater forekommer som sannolika enligt godhetstalet. 

Vidare innefattar foreliggande uppfinning en anordning for utovande av for- 
farandet enligt ovan. 

Anordningen utfor automatisk dataftngst, via organ for detsamma, av formular * 
20 med p4 forhand okand utformning och informationsinneh&ll, vid inmatning i namnda 
organ, samt lagring av forlagor av densamma. Den registrerar adaptivt och lar sig 
utformningen av formular, och innefattar en dator med foljande organ for att &stadkomma 
den adaptiva registreringen: 

organ for alstring av en formularkarta utifrin det p& forhand okanda formularets 
25 utformning for identifiering av formular innehillande information; 

organ for sokning med jamforelse av formularkartan med lagrade inlarda kartor i 
ett organ for lagring av formularkartor; 

organ for lagring av alstrad formularkarta i organet for lagring d& den ej overens- 
stammer med en lagrad karta enligt forutbestamda granser for overensstammelse; 
30 organ for indikering av overensstammelse enligt granserna for overensstammelse 

d& overensstammelse foreligger; och 

organ for identifiering och fortsatt datafingst av formularets informationsinnehaiL 
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itradesvis hird- och/eller programvarustyrda, sasom 

exempelvis: 

Skanner for dataf&ngst. 

Elektroniska lagringsmedium (h&rddisk, CD-skiva etc) for organet for lagring. 
5 Tecken, ikoner, signalgeneratorer etc som organ for indikering. 

Filter och jamfdrare for organ for sdkning och jamfdrelse samt filter och register 
for identifiering. 

Overhuvudtaget ar organen som utnyttjas i foreliggande uppfinning kanda for en 
fackman inom teknikomr&det, men deras samordning for att uppnd uppfinningens syften 
10 och andam&l ar daremot innovativ. 

I en utforingsform av foreliggande uppfinning med hanvisning till fig* 2 illustre- 
ras schematiskt ett flodesschema for skanning, identifiering, tolkning, validering av ett 
formular enligt foreliggande uppfinning. 

Fig. 2 ar indelad i prickade delomriden for att tydliggora de olika stegen i ett 
15 forfarande enligt foreliggande uppfinning, varvid stegen utgors av skanning av formular 
200, identifiering av formular 210, tolkning av formular 220 samt validering av formular 
230. 

Formularet skannas 200 in i EH, varefter identifiering 210 foljer. Identifieringen 
bestir av alstring av en linjekarta 212, alternativt en objektomrideslista, varvid en 

20 linjenyckel alstras. Darefter jamfors 214 formular 10 mot kanda nycklar i formularkarts- 
databasen, varvid en bekraftelse av identifieringen erh&lles via RCG-vardet. Nasta steg 
innefattar att avgora om identifieringen lyckades 216 enligt villkoret H Ja M eller "Nej". Om 
avgorandet resulterar i "Nej" undersoks villkorligt om det finns fler kandidater i form av 
linjenycklar 218. Om svaret ar "Ja" genomlopes en slinga i form av 214, 216, 218 anda 

25 tills lyckad identifiering istadkommes 216 eller att det inte forekommer fler linjenyckel- 
kandidater 218. 

I det fall att identifieringen lyckas vidtar tolkningen 220 av formularet genom 
tolkning med hjalp av aktuell formularkarta 222, varefter validering 230 eller utvardering 
av formulars 10 fait sker 232. Som en option kan en operator hjalpa till med urvalet om 
30 flera alternativa fait iterfinnes 234. 

I fall att identifiering 210 misslyckas och att det inte forekommer fler linjenyckel- 
kandidater 218 utfors tolkning 220 genom att sjalvupplarning med en formulardefinition 

P33354SE00AJDO/YA 



PCT/SE9 8 / 0 0 6 



9 



^ 0 1 -04- 1998 



224 istadkommes. Formulardefinitionen bestir av en mall eller ett regelverk som 
beskriver det som ar gemensamt for en specifik samling av formular, exempelvis svenska 
fakturor. Darefter tolkas RCG-vardet 226 och det avgors 228 om aktuellt RCG-varde 
iterfinnes i formularkartsdatabasen. Om svaret ar "Ja" vidtar en omtolkning 229, varefter 
tolkningen fortsatter vid 222 for att leda till valideringen 232. 

Om daremot svaret ar "Nej" vidtar validering 230, 236, varefter formularet 
sparas i formularkartsdatabasen med linjenyckeln 238. Innan stegen 236, 238 genomfores 
kan aven en operator istadkomma upplarning 239, om flera alternativa fait iterfinnes. 

Ovan beskrivna utforingsformer av foreliggande uppfinning ar mqjliga sidana, 
men inte amnade att begransa uppfinningen som sidan, utan for fackmannen inom 
teknikomr&det framgir ytterligare utforingsformer via avfattningen av bilagda patentkrav. 
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Patentkrav 



1. Forfarande for automatisk datafingst (200), via organ for detsamma, av 
formular (10) med pi forhand okand utformning och informationsinnehill (19), vid 
inmatning i namnda organ, samt lagring av forlagor av densamma, kannetecknat 

5 av att forfarandet ar adaptivt (224) och innefattar inlarning och registrering av utform- 
ningen av formular, varvid det innefattar foljande steg for att istadkomma adaptiv 
registrering (238): 

alstring av en formularkarta (18) utifrin det pi forhand okanda formularets (10) 
utformning for identifiering (210) av formular innehillande information; 
10 sokning med jamforelse (210, 220) av formularkartan (18) med lagrade inlarda 

kartor i ett organ for lagring av formularkartor; 

lagring (238) av alstrad formularkarta (18) i organet for lagring di den ej 
overensstammer med en lagrad karta enligt forutbestamda granser for overensstammelse; 
indikering av overensstammelse enligt granserna for overensstammelse di 
15 overensstammelse foreligger; och 

fortsatt datafingst (232) for identifiering av formularets informationsinnehill. 

2. Forfarande enligt krav 1, kannetecknat avatt formularkartan (18) 
bestir av en objektomrideslista med objekt (19) innefattade i formular. 

3. Forfarande enligt krav 2, kannetecknat avatt objekten (19) bestir 
20 av farger och/eller hel eller del av text. 

4. Forfarande enligt krav 1, kannetecknat avatt formularkartan (18) 
utgor en linjekarta innefattande linjeelement (14, 15) ur formularet (10). 

5. Forfarande enligt krav 4, kannetecknat avatt horisontella linjer 
(15) i linjekartan anvands for alstring av en horisontell nyckel genom att formularet 

25 indelas i ett forutbestamt antal horisontella segment langs y-axeln i ett kartesiskt koordi- 
natsystem, varvid vaije segment motsvarar en position i den horisontella nyckeln. 

6. Forfarande enligt krav 4 och 5, kannetecknat av att vertikala 
linjer (14) i linjekartan anvands for alstring av en vertikal nyckel genom att formularet 
(10) indelas i ett forutbestamt antal vertikala segment langs x-axeln i ett kartesiskt 

30 koordinatsystem, varvid varje segment motsvarar en position i den vertikala nyckeln. 

7. Forfarande enligt krav 5 och 6, kannetecknat avatt minst ett 
linjeelement (14, 15) som innefattas i ett segment marks i motsvarande nyckelposition och 
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att segment vilka saknar linjeelement forblir omarkta i motsvarande nyckelposition. 

8. Fdrfarande enligt krav 4-7, kannetecknat avatten horisontell 
nyckel och/eller en vertikal nyckel utgor en linjenyckel i linjekartan (18), varvid, vid 
namnda sokning, alstrad linjenyckel jamfors (214) med linjenycklar som finns lagrade i 

5 organet for lagring for kontroll av overensstammelse. 

9. Forfarande enligt krav 8, kannetecknat av att linjenycklarna ar 
sorterade efter antalet markningar i organet for lagring. 

10. Forfarande enligt krav 1 och 2, kannetecknat av att objekts 
horisontella position i objektomrideslistan anvands for alstring av en horisontell nyckel 

10 genom att formularet indelas i ett forutbestamt antal horisontella segment langs y-axeln i 
ett kartesiskt koordinatsystem, varvid varje segment motsvarar en position i den horison- 
tella nyckeln. 

1 1 . Forfarande enligt krav 1 och 2, kannetecknat avatt objekts 
vertikala position i objektomrideslistan anvands for alstring av en vertikal nyckel genom 

15 att formularet indelas i ett forutbestamt antal vertikala segment langs x-axeln i ett 

kartesiskt koordinatsystem, varvid varje segment motsvarar en position i den vertikala 
nyckeln. 

12. Forfarande enligt krav 10 och 11, kannetecknat avatt minst ett 
objekt som innefattas i ett segment marks i motsvarande nyckelposition och att segment 

20 vilka saknar objekt forblir omarkta i motsvarande nyckelposition. 

13. Forfarande enligt krav 10-12, kannetecknat av att en horisontell 
nyckel och/eller en vertikal nyckel utgor en objektnyckel i objektomrideslistan, varvid, 
vid namnda sokning, alstrad objektnyckel jamfors med objektnycklar som finns lagrade i 
organet for lagring for kontroll av overensstammelse. 

25 14. Forfarande enligt krav 15, kannetecknat avatt objektnycklarna 

ar sorterade efter antalet markningar i organet for lagring. 

15. Forfarande enligt krav 1-14, kannetecknat avatt sokningen 
(210, 218) istadkommes att resultera i ett antal pi forhand begarda troliga kandidater for 
aktuellt sokt formular. 

30 16. Forfarande enligt krav 15, kannetecknat av att en operator 

manuellt (234, 239) stoder hela eller delar av den adaptiva registreringen eller identifie- 
ringen av nytt formular resp registrerat formular om flera alternativa kandidater fore- 



P33354SE00/UDO/YA 



PCT/SE9 8 /O'joui 



12 




0 1 -04- 1993 



kommer som sannolika enligt ett godhetstal. 

17. Forfarande enligt krav 1-16, kannetecknat avatt formularets 
identitet faststalles genom datafangst av RCG-vardet (214). 

18. Anordning for automatisk datafangst, via organ for detsamma, av formular 
5 (10) med pa forhand okand utformning och informationsinnehall (19), vid inmatning i 

namnda organ, samt lagring av forlagor av densamma, kannetecknad avatt 
den adaptivt lar sig och registrerar utformningen av formular (10), och innefattar en dator 
med foljande organ for att astadkomma den adaptiva registreringen (238): 

organ for alstring av en formularkarta (18) utifran det pa forhand okanda formula- 
10 rets (10) utformning for identifiering av formular innehallande information; 

organ for sokning med jamforelse av formularkartan med lagrade inlarda kartor i 
ett organ for lagring av formularkartor; 

organ for lagring av alstrad formularkarta (18) i organet for lagring da den ej 
overensstammer med en lagrad karta enligt forutbestamda granser for overensstammelse; 
15 organ for indikering av overensstammelse enligt granserna for overensstammelse 

da overensstammelse foreligger; och 

organ for identifiering och fortsatt datafangst av formularets informationsinnehall. 

19. Anordning enligt krav 18, kannetecknad av att formularkartan 
(18) bestar av en objektomradeslista med objekt (19) innefattade i formular (10). 

20 20. Anordning enligt krav 19, kannetecknad av att objekten (19) 

bestar av farger och/eller hel eller del av text. 

21. Anordning enligt krav 18, kannetecknad av att formularkartan 
(18) utgor en linjekarta innefattande linjeelement (14, 15) ur formularet. 

22. Anordning enligt krav 21, kannetecknad avatt horisontella linjer 
25 (15) i linjekartan anvands for alstring av en horisontell nyckel genom att formularet 

indelas i ett fdrutbestamt antal horisontella segment langs y-axeln i ett kartesiskt koordi- 
natsystem, varvid varje segment motsvarar en position i den horisontella nyckeln. 

23. Anordning enligt krav 21 och 22, kannetecknad av att vertikala 
linjer (14) i linjekartan anvands for alstring av en vertikal nyckel genom att formularet 

30 indelas i ett fdrutbestamt antal vertikala segment langs x-axeln i ett kartesiskt koordinat- 
system, varvid varje segment motsvarar en position i den vertikala nyckeln. 

24. Anordning enligt krav 22 och 23, kannetecknad av att minst ett 
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linjeelement (14, 15) som^mnefattas i ett segment marks i motsvarande nyckelposition och 
att segment vilka saknar linjeelement forblir omarkta i motsvarande nyckelposition. 

25. Anordning enligt krav 22-24, kannetecknad avatten horisontell 
nyckel och/eller en vertikal nyckel utgor en linjenyckel i linjekartan, varvid, vid namnda 

5 sokning, alstrad linjenyckel jamfors med linjenycklar som finns lagrade i organet for 
lagring for kontroll av overensstammelse. 

26. Anordning enligt krav 25, kannetecknad av att linjenycklarna ar 
sorterade efter antalet markningar i organet for lagring. 

27. Anordning enligt krav 18 och 19, kannetecknad av att objekts 
10 (19) horisontella position i objektomr&deslistan anvands for alstring av en horisontell 

nyckel genom att formularet indelas i ett forutbestamt antal horisontella segment langs y- 
axeln i ett kartesiskt koordinatsystem, varvid varje segment motsvarar en position i den 
horisontella nyckeln. 

28. Anordning enligt krav 18 och 19, kannetecknad av att objekts 
15 (19) vertikala position i objektomrideslistan anvands for alstring av en vertikal nyckel 

genom att formularet indelas i ett forutbestamt antal vertikala segment langs x-axeln i ett 
kartesiskt koordinatsystem, varvid varje segment motsvarar en position i den vertikala 
nyckeln. 

29. Anordning enligt krav 27 och 28, kannetecknad av att minst ett 
20 objekt som innefattas i ett segment marks i motsvarande nyckelposition och att segment 

vilka saknar objekt forblir omarkta i motsvarande nyckelposition. 

30. Anordning enligt krav 27-29, kannetecknad av att en horisontell 
nyckel och/eller en vertikal nyckel utgor en objektnyckel i objektomrideslistan, varvid, 
vid namnda sokning, alstrad objektnyckel jamfors med objektnycklar som finns lagrade i 

25 organet for lagring for kontroll av overensstammelse. 

31. Anordning enligt krav 30, kannetecknad avatt objektnycklarna 
ar sorterade efter antalet markningar i organet for lagring. 

32. Anordning enligt krav 18-31, kannetecknad av att sokningen 
istadkommes att resultera i ett antal pi forhand begarda troliga kandidater for aktuellt sokt 

30 formular. 

33. Anordning enligt krav 15, kannetecknad av att en operator via 
anordningen manuellt kan stodja hela eller delar av den adaptiva registreringen eller 
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identifieringen av nytt formular resp registrerat formular om flera alternativa kandidater 
forekommer som sannolika enligt ett godhetstal. 

34. Anordning enligt krav 18-33, kannetecknat avatt formularets 
identitet faststalles genom dataf&ngst av RCG-vardet. 
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Sammandrag 

Uppfinningen anger ett forfarande och en anordning for automatisk datafiLngst av 
pappersburen information, varvid ett okant formular (10) som skannas in i en dator 
omvandlas till linjeelement (14, 15) for snabbare identifiering. Identifiering av kanda 
blanketter (10) underlattas aven via linjeelementen (14, 15). 
(Fig. 1) 
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Scanna formular 



200 



± 



Gflr linjckarta 
GOr linjcayckel 
(GOr oejekaoaradeslisa) 



212 



210 



Idencifiera formular genom att jlmfora 
mot klnda nydtlar i forraullrkartsdaubuen 
Bekrtfta zenom nt to flu RCG-vtrdet 



214 



216 



218 



A LyckadM N 

\ identifiering / 



Nej 



Ja 



Fmns fler 

linjenyekel- 

kandidater 



m. 



Tolka fonnuilrct m hj a 
aktueli foanuttriorta 



222 



224 I G<Jr Jj^^PP 1 *™ 0 ^ 

med hjiip av fwmullrdcfaitioaea 



226 



Toflca RCG-vflrdet 



229 



omtoQcoing 




Finns aktuellt RGC-vflrde 
t fonnuiaricartsdatabasen 



> 



220 
228 



Nej 



234 



239 



Operator hjilper med urval, 
om flen alternativa fiUt hhtas 



Operator hjilper uppllmingen, om 
flera flera altemativa fait hhtas 



Validering 



232 



236 



Validering 



230 



238 



Span fbrmuUret i ConnuUrkartsdatabi 
(inkl linjenyckel) 
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£X AP ARIPO Patent: GH Ghana. GM Gambia, KJE Kenya. LS Lesotho. MW Malawi. SD Sudan. SZ Swaziland, UG Uganda. 

^ ZW Zimbabwe, and any other State which is a Contracting State of the Harare Protocol and of the PCT 

£j) EA Eurasian Patent: AM .Armenia. AZ Azerbaijan. BY Belarus. KG Kyrgyzstan. KZ Kazakhstan, MD Republic of 

Moldova. RU Russian Federation. TJ Tajikistan. TM Turkmenistan, and any other State which is a Contracting State 

of the Eurasian Patent Convention and of the PCT 
EP European Patent: AT Austria. BE Belgium. CH and LI Switzerland and Liechtenstein. DE Germany, DK Denmark. 

ES Spain. FI Finland. FR France. GB United Kingdom. GR Greece. EE Ireland. IT Italy. LU Luxembourg. MC Monaco. 

NL Netherlands, PT Portugal. SE Sweden, and any other State which is a Contracting State of the European Patent 

Convention and of the PCT 

^ OA OAPI Patent: BF Burkina Faso. BJ Benin. CF Central African Republic. CG Congo. CI Cote d'Woire. CM Cameroon. 

GA Gabon. GN Guinea. ML Mali. MR Mauritania. NE Niger. SN Senegal. TD Chad. TG Togo, and any other State 
which is a member State of OAPI and a Contracting State of the PCT (if other kind of protection or treatment desired, specify 
on dotted line) 

National Patent (if other kind of protection or treatment desired, specify on dotted line): 

LT Lithuania 
LU Luxembourg 
LV Latvia 

MD Republic of Moldova 

MG Madagascar 
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Albania 

Armenia 

Austria 

Australia 

Azerbaijan 

Bosnia and Herzegovina 

Barbados 

Bulgaria 

Brazil 

Belarus 
Canada 

CH and LI Switzerland and Liechtenstein 

CN China 

Cuba 

Czech Republic 

Germany 

Denmark 

Estonia 

Spain 

Finland 

United Kingdom 

Georgia 

Ghana 

GM Gambia 

GW Guinea-Bissau 
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ICE Kenya 

KG Kyrgyzstan 

KP Democratic People's Republic of Korea 
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MK The former Yugoslav Republic of Macedonia 
MN Mongolia 

MW Malawi 
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NO Norway 

NZ New Zealand 

PL Poland 

PT Portugal 

RO Romania 

RU Russian Federation 

SD Sudan 
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SI Slovenia 

SK Slovakia 

SL Sierra Leone 

TJ Tajikistan 

TM Turkmenistan 

TR Turkey 

TT Trinidad and Tobago 

UA Ukraine 

UG Uganda 
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Sri Lanka 
Liberia 
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Check-boxes reserved for designating States (for the purposes of 
a national patent) which have become party to the PCT after 
issuance of this sheet: 
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UZ 
VN 
YU 
ZW 



Uzbekistan 
Viet Nam . 
Yugoslavia 
Zimbabwe . 



In addition to the designations made above, the applicant also makes under Rule 4.9(b) all designations which would be pe: 
under the PCT except the designation(s) of 



itted 



The applicant declares that those additional designations are subject to confirmation and that any designation which is not connrmed 
before the expiration of 1 5 months from thcpriority date is to be regarded as withdrawn by the applicant at the expiration of that ume 
limit, f Confirmation of a designation consists of the filing of a notice specifying that designation and the payment of the designation and confirmation 
fees. Confirmation must reach the receiving Office within the 1 5 -month time limit. ) ) 
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"P" document published prior to the international filing date but later than 
the priority date claimed 



later document published after the international filing dale or priority 
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document of particular relevance: the claimed invention cannot be 
considered novel or cannot be considered to involve an inventive 
step when the document is taken alone 

document of particular relevance: the claimed invention cannot be 
considered to involve an inventive step when the document is 
combined with one or more other such documents, such combination 
being obvious to a person skilled in the art 
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1. The designated Office is hereby notified of its election made: 

| X | in the demand filed with the International Preliminary Examining Authority on: 

29 October 1998 (29.10.98) 

| | in a notice effecting later election filed with the International Bureau on: 



2. The election | X | was 

| | was not 

made before the expiration of 19 months from the priority date or, where Rule 32 applies, within the time limit under 
Rule 32.2(b). 





Authorized officer 


The International Bureau of W1PO 
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P33354PC00 


™o mim-ru™ A<-rir>M See Notification of Transmittal of International 
FOR FURTHER ACTION preliminary Examinat ion Report (Form PCT/IPEA/416> 


International application No. 
PCT/SE98/00602 


International filing date (day/month/year) 
01.04.1998 


Priority date (day/month/year) 
01.04.1997 


International Patent Classification (IPC) or national classification and WC 6 
G06K 9/20 // G06F 17/24 


Applicant 

ReadSoft AB et al. 



This international preliminary examination report has been prepared by this International Preliminary Examining 
Authority and is transmitted to the applicant according to Article 36. 



2. This REPORT consists of a total of _3_ 



sheets, including this cover sheet. 



This report is also accompanied by ANNEXES, i.e., sheets of the description, claims and/or drawings which have 
been amended and are the basis for this report and/or sheets containing rectifications made before this Authority 
(see Rule 70.16 and Section 607 of the Administrative Instructions under the PCT). 



These annexes consist of a total of 



sheets. 
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Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 



Date of submission of the demand I 
29.10.1998 
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Authorized officer 
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INTERNATIONAL PRELIMINARY EXAMINATION REPORT 



InternatidTEff application No. 
PCT/SE98/00602 



I. Basis of the report 



1 This report has been drawn on the basis of (Replacement sheets which have been furnished to the receiving Office m response to an invitation 
under Article 14 are referred to in this report as "originally fled" and are not annexed to the report since they do not contain amendments,): 

| [ the international application as originally filed; 



the description, 


pages 1-9 


, as originally filed, 




naees 


, filed with the demand, 




naees ! 


, filed with the letter of 




Daces 


, fileH with the letter of 


the claims, 


Nos, 


, as- originally fileoV 




Nos. 


, as amended under Article 19, 




Nos. 


, filed with the demand. 




Nos. :l-34 


, *M with the letter of 14.04-199-9 




Nos. 


. filed with the letter of 


the drawings, 


1. 

sheets/fig 1—2 


, as-originally filed, 


shoots/fig 


, filed with the demand 




sheets/fie. 


. filed with thft ^tter of 




sheets/fie 


, filed with the letter of 



2. The amendments have resulted in the cancellation ofc 
| | the description, pages 

[ j the claims, , Nos. * 



[ [ the drawings/ sheete/fig 



3 I — I This report has been established as if (some of) the amendments had not been made; since they have-been considered to 
* I — * go beyond the disclosure as fileoV as indicated in the supplemental Bo* (Rule 70,2(c». 



4. Additional observations, if necessary: 



1 
1 
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V. Resoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 



1 . Statement 



f 

Novelty (N) £ Claims 1-34 , YES 

f Claims . NO 

Inventive step (IS) | Claims 1-34 YES 



Claims 



NO 



Industrial applicability (IA) £ Claims 1-34 . YES 

? Claims : NO 



2. Citations and explanations £ 

The claimed invention relates to a method and a device for 
automatic data acquisition of forms where the system has no 
prior knowledge about what the form looks like. 

In the International Search Report, the following documents 
were cited: 1 f 

Dl: US, A, 4933979 l : 

D2: US, A, 5140650 . 

\ 

D3: US, A, 5293429 \ 

Dl describes > a data reading apparatus which registers form 
information for each kind of form sheet from which the data 
are to be read (see- column 9, line 51 - column 10, line 52) . 

D2 shows a method for automatic extraction of data from 
printed forms. A tblank master form is first scanned and 
stored. 

In D3, a system and a- method for automatically classifying 
forms is described.-: 

The described invention differs from prior art according to 
the cited documents in that no initial scanning of a blank 
form sheet is necessary. This difference cannot be considered 
as obvious to a person skilled in the art. The subject matter 
of claims 1-34 is therefore novel and inventive. 

Consequently, the subject matter of claims 1-34 is considered 
to fulfil the requirements of novelty, inventive step and 
industrial applicability. 
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1. Method for the automatic data acquisition (200), by means of a means for the 
same, of forms (10) whose design and information content (19) is not known in advance, by input 
into the said means, together with storage of patterns of the same, characterised in that 

5 the method is adaptive (224) and includes self-learning and registration of the design of forms 
without initial scanning of a blank form sheet being necessary, whereby it includes the following 
steps to accomplish the adaptive registration (238): 

generation oft a form map (18) based on the previously unknown form's (10) design 
for identifying (210) information contained on the form; 
10 searching and comparing (210, 220) the form map (18) with stored, registered maps 

in a means for storing form maps; 

storage (23 8)/ of generated form maps (18) in the storage means when they do not 
coincide with a stored map according to pre-determined limits for agreement; 

indication of;agreement according to the limits for agreement when agreement is 

15 found; and 

continued data acquisition (232) for identifying the information content of the form. 

2. Method according to claim 1, characterised in that the form map (18) 
consists of an object area list with objects (19) contained in the form. 

3. Method according to claim 2, characterised in that the object (19) 
20 comprises colours and/or wholly or partly of text. 

4. Method according to claim 1, characterised in that the form map (18) 
constitutes a line map comprising line elements (14,15) from the form (10). 

5. Method according to claim 4, characterised in that the horizontal lines 
(15) in the line map are used to generate a horizontal key by dividing the form into a pre- 

25 determined number of horizontal segments along the y-axis in a cartographic system of co- 
ordinates, whereby each segment is equivalent to a position in the horizontal key. 

6. Method according to claims 4 and 5, characterised in that the vertical 
lines (14) in the line map are used to generate a vertical key by dividing the form (10) into a pre- 
determined number of vertical segments along the x-axis in a cartographic system of co- 

30 ordinates, whereby each segment is equivalent to a position in the vertical key. 
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7. Method according to claims 5 and 6, characterised in that at least one 
line element (14; 15) that is included in a segment is marked in the equivalent key position, and 
that segments that lack line elements remain unmarked in the equivalent key position. 

8. Method according to claims 4-7, characterised in that the horizontal key 
5 and/or a vertical key constitute a line key in the line map (18), whereby, during the said 

searching, the line key generated is compared (214) with line keys stored in the storage means for 
verifying agreement. f- 

9. Method According to claim 8, characterised in that the line keys are 
sorted in the storage means according to the number of markings. 

10 10. Method ^according to claims 1 and 2, characterised in that the object's 

horizontal position in theiobject area list is used to generate a horizontal key by dividing the form 
into a pre-determined number of horizontal segments along the y-axis in a cartographic system of 
co-ordinates, whereby each segment is equivalent to a position in the horizontal key. 

1 1 . Method ^according to claims 1 and 2, characterised in that the object's 
15 vertical position in the object area list is used to generate a vertical key by dividing the form into 

a pre-determined number, of vertical segments along the x-axis in a cartographic system of co- 
ordinates, whereby each segment is equivalent to a position in the vertical key. 

i 

12. Method [according to claims 10 and 11, characterised in that at least 
one object that is included in a segment is marked in the equivalent key position, and that 

20 segments that lack objects remain unmarked in the equivalent key position 

13. Method [according to claims 10-12, characterised in that a horizontal 

key and/or a vertical key Iconstitute an object key in the object area list, whereby, during the said 

J 

searching, the object key igenerated is compared with object keys stored in the storage means for 
verifying agreement. 

25 14. Method according to claim 13, characterised in that the object keys are 

sorted in the storage means according to the number of markings. 

15. Method according to claims 1-14, characterised in that the searching 

(210, 218) results in a pre-defined number of requested probable candidates for the currently 

searched form. » 
30 16. Method according to claim 15, characterised in that an operator can 

support manually (234, 239) the whole or parts of the adaptive registration or identification of the 
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new form or registered forms respectively if several alternative candidates are found as 
probabilities according to a factor of merit. 

17. Method according to claims 1-16, characterised in that the identity of 
the form is confirmed by the data acquisition of a ReCoGnition (RCG) value (214) which 

5 uniquely identifies a form. 

18. Arrangement for the automatic data acquisition, by means of a means for the 
same, of forms (10) whose design and information content (19) is not known in advance, by input 
into the said means together with storage of patterns of the same, characterised in that it 
learns adaptively and registers the design of forms (10) without initial scanning of a blank form 

10 sheet being necessary, and includes a computer with the following means for carrying out the 
adaptive registration (238): 

means for generating a form map (18) based on the previously unknown form's (10) 
design for identifying information contained on the form; 

means for searching and comparing the form map with stored, recognised maps in a 
15 means for storing form maps; 

means for storage of generated form maps (18) in the storage means when they do 
not coincide with a stored map according to pre-determined limits for agreement; 

means for indicating agreement according to the limits for agreement when 
agreement is found; and 

20 means for identification and continued data acquisition of the information content of 

the form. 

19. Arrangement according to claim 18 characterised in that the form map 
(18) consists of an object area list with objects (19) contained in the form (10). 

20. Arrangement according to claim 19, characterised in that the object 
25 (19) comprises colours and/or wholly or partly of text. 

21 . Arrangement according to claim 18, characterised in that the form map 
(18) constitutes a line map comprising line elements (14, 15) from the form. 

22. Arrangement according to claim 21, characterised in that the 
horizontal lines (15) in the line map are used to generate a horizontal key by dividing the form 

30 into a pre-determined number of horizontal segments along the y-axis in a cartographic system of 
co-ordinates, whereby each segment is equivalent to a position in the horizontal key. 
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23. Arrangement according to claims 2 1 and 22, characterised in that the 
vertical lines (14) in the line map are used to generate a vertical key by dividing the form into a 
pre-determined number of vertical segments along the x-axis in a cartographic system of co- 
ordinates, whereby each segment is equivalent to a position in the vertical key. 
5 24. Arrangement according to claims 22 and 23, characterised in that at 

least one line element (14, 15) that is included in a segment is marked in the equivalent key 
position, and that segments that lack line elements remain unmarked in the equivalent key 
position. 

25. Arrangement according to claims 22-24, characterised in that the 
10 horizontal key and/or a vertical key constitute a line key in the line map, whereby, during the 

said searching, the line key generated is compared with line keys stored in the storage means for 
verifying agreement. 

26. Arrangement according to claim 25, characterised in that the line keys 
are sorted in the storage means according to the number of markings. 

15 27. Arrangement according to claims 18 and 19, characterised in that the 

object's (19) horizontal position in the object area list is used to generate a horizontal key by 
dividing the form into a pre-determined number of horizontal segments along the y-axis in a 
cartographic system of co-ordinates, whereby each segment is equivalent to a position in the 
horizontal key. 

20 28. Arrangement according to claims 18 and 19, characterised in that the 

object's (19) vertical position in the object area list is used to generate a vertical key by dividing 
the form into a pre-determined number of vertical segments along the x-axis in a cartographic 
system of co-ordinates, whereby each segment is equivalent to a position in the vertical key. 

29. Arrangement according to claims 27 and 28, characterised in that at 
25 least one object that is included in a segment is marked in the equivalent key position, and that 

segments that lack objects remain unmarked in the equivalent key position. 

30. Arrangement according to claims 27-29, characterised in that a 
horizontal key and/or a vertical key constitute an object key in the object area list, whereby, 
during the said searching, the object key generated is compared with object keys stored in the 

30 storage means for verifying agreement. 

3 1 . Arrangement according to claim 30, characterised in that the object 
keys are sorted in the storage means according to the number of markings. 
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32. Arrangement according to claims 18-31, characterised in that the 



searching results in a pre-defined number of requested probable candidates for the currently 
searched form. 

33. Arrangement according to claim 32, characterised in that an operator 
5 can support manually the whole or parts of the adaptive registration or identification of the new 

form or registered forms respectively if several alternative candidates are found as probabilities 
according to a factor of merit. 

34. Arrangement according to claims 18-33, characterised in that the 
identity of the form is confirmed by the data acquisition of a a ReCoGnition (RCG) value (214) 

10 which uniquely identifies a form. 
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